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(54) Method and system for addressing audio-visual content fragments 



(57) A process (900) is disclosed for applying an 
audio-visual addressing scheme to locate fragments of 
audio-visual content. The process (900) comprises a 
definition sub-process (902), an encoding sub-process 
(904), and a decoding sub-process (906). The definition 
sub-process (902) defines a logical model (908) for a 
class of audio-visual media. The logical model (908) is 
used in the encoding sub-process (904) for encoding an 
address for an audio-visual resource fragment belong- 
ing to the class of audio-visual media. The logical model 
(908) is also used for locating the audio-visual resource 
fragment that is associated with the address. 
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Description 

Copyright Notice 

5 [0001] This patent specification contains material that is subject to copyright protection. The copyright owner has 
no objection to the reproduction of this patent specification or related materials from associated patent office files for 
the purposes of review, but otherwise reserves all copyright whatsoever. 

[0002] The present invention relates generally to retrieval of data from data bases, and in particular, to retrieval of 
audio-visual data. 

10 [0003] The advent of technology providing mass-market access to the Internet places vast amounts of on-line infor- 
mation within relatively easy reach. The World Wide Web (WWW) (hereunder, the Web) underpins much of the growth 
of Internet use, particularly because of the ease of use, and also due to the intuitive user interface presented by Web 
browsers. Universal Resource Indicators (URIs) are a ubiquitous addressing feature used to locate target resources in 
the Web context. This is particularly relevant when Web pages are used in conjunction with a Common Gateway Inter- 

15 face (CGI) scripting application, which allows the Web page to become, in essence, the front end of a myriad of data- 
bases accessible over the Internet. 

[0004] Notwithstanding the explosive progress described however, a Web user is, in most cases, unable to "drill 
down" beyond a certain level of data, and must, in many cases, down-load an inconveniently large and cumbersome 
amount of information in order to locate useful information. Illustrating this fact, consider investigating all flights from 

20 London to Moscow departing from Heathrow airport on a given date. In order to make a selection based on a number 
of criteria such as departure time, airline, number of stops and so on, a long list of flights typically needs to be down- 
loaded and scanned, either manually or using a back-end application on a local personal computer (PC). 
[0005] Further exemplifying the problem, certain types of data such as, for example, audio-visual (AV) data, typi- 
cally manifest themselves as monolithic blocks of information. The internal structure of such data, whether it be a par- 

25 ticular video segment, or fragment, in a movie, or a specific movement in a symphony, is neither visible, nor addressible, 
or consequently accessible in terms of fragments. 

[0006] Taking a more extreme example, off-line audio-visual data, in the form of celluloid film archives, paper-based 
libraries, and a wealth of other sources, are also not addressable, and are thus invisible and inaccessible at the "frag- 
ment" level. Although particular books can be located, by call number and location in a library, specific chapters thereof 

30 are not visible or addressable, and consequently, not accessible. 

[0007] Extensible Markup Language (XML) provides a drill down capability for a limited sub-set of on-line informa- 
tion, namely information which is coded in XML. However, useful as this may be, the overwhelming bulk of available 
information has been produced in other programming formats such as Hypertext Markup Language (HTML), or alter- 
natively, is in hard copy form in physical archives and libraries. The aforementioned types of information are referred to 

35 as "legacy" information. 

[0008] It is an object of the present invention to substantially overcome, or at least ameliorate, one or more disad- 
vantages of existing arrangements. 

[0009] It is an object of the present invention to substantially overcome, or at least ameliorate, one or more disad- 
vantages of existing arrangements. 
40 [0010] According to a first aspect of the invention, there is provided a method for addressing an AV fragment of an 
AV resource over a network, to an arbitrary level of resolution, said AV resource being a member of a class of AV 
resources, wherein a logical model is associated with members of the class of AV resources; said method comprising 
steps of: 

45 determining a URI network address for the AV resource; 

applying the logical model to the AV resource to form a hierarchical representation of the AV resource including a 
representation of the AV fragment; 

determining a fragment identifier for the fragment dependent upon the representation of the AV fragment; and 
combining the URI network address and the fragment identifier to form a URI reference, being an address for the 
so AV fragment. 

[0011] According to a further aspect of the invention, there is provided a method for locating an AV fragment of an 
AV resource over a network, to an arbitrary level of resolution, said AV resource being a member of a class of AV 
resources, wherein a logical model is associated with members of the class of AV resources; said method comprising 
55 steps of: 

using a URI network address portion of a URI reference to locate the AV resource; 

identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon one of (a) the fragment identifier, 
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(b) the URI, and (c) the fragment identifier and the URI; and 

applying an XPath based addressing scheme to the fragment identifier, said scheme including at least one of a time 
axis, a time function, a region axis, and a region function, for addressing temporal and spatial fragments of the AV 
resource, thereby locating the AV fragment. 

5 

[0012] According to a further aspect of the invention, there is provided a method for addressing an AV fragment of 
an AV resource over a network, to an arbitrary level of resolution, said AV resource being a member of a class of AV 
resources, wherein a logical model is associated with members of the class of AV resources; said method comprising 
steps of: 

w 

determining a URI network address for the AV resource; 

applying the logical model to the AV resource to form a hierarchical representation of the AV resource including a 
representation of the AV fragment; 

determining a fragment identifier for the fragment dependent upon the representation of the AV fragment, including 
75 a sub-step of identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon one of (a) the frag- 
ment identifier, (b) the URI, and (iii) the fragment identifier and the URI; wherein said identifying step comprises a 
sub-step of applying an XPath based addressing scheme to the fragment identifier, said scheme including at least 
one of a time axis, a time function, a region axis, and a region function, for addressing temporal and spatial frag- 
ments of the AV resource; and 

20 combining the URI network address and the fragment identifier to form a URI reference, being an address for the 
AV fragment. 

[001 3] According to a further aspect of the invention, there is provided an apparatus for addressing an AV fragment 
of an AV resource over a network, to an arbitrary level of resolution, said AV resource being a member of a class of AV 
25 resources, wherein a logical model is associated with members of the class of AV resources; said apparatus compris- 
ing: 

first determining means for determining a URI network address for the AV resource; 

applying means for applying the logical model to the AV resource to form a hierarchical representation of the :AV 
30 resource including a representation of the AV fragment; 

second determining means for determining a fragment identifier for the AV fragment dependent upon the represen- 
tation of the AV fragment; and 

combining means for combining the URI network address and the fragment identifier to form a URI reference, being 
an address for the AV fragment. 

35 

[0014] According to a further aspect of the invention, there is provided an apparatus for addressing an AV fragment 
of an AV resource over a network, to an arbitrary level of resolution, said AV resource being a member of a class of AV 
resources, wherein a logical model is associated with members of the class of AV resources; said apparatus compris- 
ing: 

40 

first determining means for determining a URI network address for the AV resource; 

first applying means for applying the logical model to the AV resource to form a hierarchical representation of the 
AV resource including a representation of the AV fragment; 

second determining means for determining a fragment identifier for the fragment dependent upon the representa- 
45 tion of the AV fragment, including identifying means for identifying (i) a type of the AV resource, and (ii) the logical 
model, dependent upon one of (a) the fragment identifier, (b) the URI, and (iii) the fragment identifier and the URI; 
wherein said identifying means comprises second applying means for applying an XPath based addressing 
scheme to the fragment identifier, said scheme including at least one of a time axis, a time function, a region axis, 
and a region function, for addressing temporal and spatial fragments of the AV resource; and 
so combining means for combining the URI network address and the fragment identifier to form a URI reference, being 
an address for the AV fragment. 

[001 5] According to a further aspect of the invention, there is provided an apparatus for locating an AV fragment of 
an AV resource over a network, to an arbitrary level of resolution, said AV resource being a member of a class of AV 
55 resources, wherein a logical model is associated with members of the class of AV resources; said apparatus compris- 
ing: 

utilisation means for using a URI network address portion of a URI reference to locate the AV resource; 
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identifying means for identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon one of (a) 
the fragment identifier, (b) the URI, and (c) the fragment identifier and the URI ; and 

applying means for applying an XPath based addressing scheme to the fragment identifier, said scheme including 
at least one of a time axis, a time function, a region axis, and a region function, for addressing temporal and spatial 
5 fragments of the AV resource, thereby locating the AV fragment. 

[0016] According to a further aspect of the invention, there is provided a computer readable memory medium for 
storing a program for apparatus for addressing an AV fragment of an AV resource over a network to an arbitrary level 
of resolution, said AV resource being a member of a class of AV resources, wherein a logical model is associated with 
w members of the class of AV resources; said program comprising: 

code for a first determining step for determining a URI network address for the AV resource; 
code for an applying step for applying the logical model to the AV resource to form a hierarchical representation of 
the AV resource including a representation of the AV fragment; 
is code for a second determining step for determining a fragment identifier for the AV fragment dependent upon the 
representation of the AV fragment; and 

code for a combining step for combining the URI network address and the fragment identifier to form a URI refer- 
ence. 

20 [0017] According to a further aspect of the invention, there is provided a computer readable memory medium for 
storing a program for apparatus for addressing an AV fragment of an AV resource over a network, to an arbitrary level 
of resolution, said AV resource being a member of a class of AV resources, wherein a logical model is associated with 
members of the class of AV resources; said program comprising: 

25 code for a first determining step for determining a URI network address for the AV resource; 

code for a first applying step for applying the logical model to the AV resource to form a hierarchical representation 
of the AV resource including a representation of the AV fragment; 

code for a second determining step for determining a fragment identifier for the fragment dependent upon the rep- 
resentation of the AV fragment, including code for an identifying step for identifying (i) a type of the AV resource, 
30 and (ii) the logical model, dependent upon one of (a) the fragment identifier, (b) the URI, and (iii) the fragment iden- 

tifier and the URI; wherein said code for the identifying step comprises code for a second applying step for applying 
an XPath based addressing scheme to the fragment identifier, said scheme including at least one of a time axis, a 
time function, a region axis, and a region function, for addressing temporal and spatial fragments of the AV 
resource; and 

35 code for a combining step for combining the URI network address and the fragment identifier to form a URI refer- 

ence, being an address for the AV fragment. 

[0018] According to a further aspect of the invention, there is provided a computer readable memory medium for 
storing a program for apparatus for locating an AV fragment of an AV resource over a network, to an arbitrary level of 
40 resolution, said AV resource being a member of a class of AV resources, wherein a logical model is associated with 
members of the class of AV resources; said program comprising: 

code for a utilisation step for using a URI network address portion of a URI reference to locate the AV resource; 
code for an identifying step for identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon 
45 one of (a) the fragment identifier, (b) the URI, and (c) the fragment identifier and the URI ; and 

code for an applying step for applying an XPath based addressing scheme to the fragment identifier, said scheme 
including at least one of a time axis, a time function, a region axis, and a region function, for addressing temporal 
and spatial fragments of the AV resource, thereby locating the AV fragment. 

so [0019] Various aspects of the prior art, and a preferred embodiment of the present invention will now be described 
by way of example only with reference to the drawings, in which: 

Fig. 1 depicts a prior art system for accessing audio data on a CD ROM using the Internet; 
Fig. 2 illustrates indexing typically provided for a CD ROM according to Fig. 1 ; 
55 Fig. 3 depicts a preferred embodiment of the addressing method in relation to CD ROMs according to the present 

invention; 

Fig. 4 shows a process for applying an audio-visual addressing scheme to locate fragments of audio-visual content; 
Fig. 5 illustrates application of the method in Fig. 3 to addressing a fragment of audio data on a CD ROM; 
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Fig. 6 depicts the preferred embodiment applied to addressing a fragment of digital video content on a CD ROM; 
Fig. 7 shows a process for locating audio-visual fragments using the "avptr" addressing scheme; 
Fig. 8 shows a process for evaluating a location step in Fig. 7; 
Fig. 9 shows an exemplary process for locating video data on a digital video tape; 
5 Fig. 10 shows an exemplary process for locating visual data in a region; 

Fig. 1 1 shows an exemplary process for locating audio data on an audio CD; 
Fig. 12 depicts the locating of resources using conventional URIs; 

Fig. 13 illustrates use of extended URIs for fragment location according to the preferred embodiment; and 
Fig. 14 is a schematic block diagram of a general purpose computer upon which the preferred embodiment of the 
~w present invention can be practiced. 

[0020] Where reference is made in any one or more of the accompanying drawings to steps and/or features, which 
have the same reference numerals, those steps and/or features have for the purposes of this description the same func- 
tion^) or operation(s), unless the contrary intention appears. 

75 [0021 ] It is noted that the introductory part of the description makes reference, for illustrative purposes, to audio and 
video content which is stored on Compact Disk Read Only Memory (CD ROM) media accessed by a "Juke Box" device 
which is capable of storing a number of such disks and accessing them according to an address. 
[0022] Fig. 1 depicts a prior art system used to locate an audio content CD ROM 312 using the Internet 308 as a 
vehicle. A user (not shown) uses a personal computer (PC) 304 which is connected to the Internet 308 in order to con- 

20 nect to the server 306 of an on-line music provider. The server 306 is connected to a CD ROM juke box 310 which 
houses a plurality of CD ROMs 31 2, 31 6. Each CD ROM 312, 31 6 contains individual songs exemplified, for illustrative 
purposes only, by bold lines 314, 318 respectively. The user has a paper description 300 of the desired CD ROM 312 
containing a title 326 of the CD ROM, and also a list of the songs 302, 320. The user uses a Universal Resource Indi- 
cator (URI) 324 which "points" to the address of the CD ROM 31 2, and the user is able to download music from the CD 

25 ROM 31 2 over the system. c\ x 

[0023] In Fig. 2, the CD ROM 312 can be portrayed in a description 400 as containing a list of songs 402, 404 under 
a title 41 4, where the song 404 has indices 41 0 and 41 2 which point to particular segments within the song 404. Termi- 
nology such as "songs" is used for illustration in this part of the description, noting that in fact, as described, the audio *^ 
content is actually stored on CD ROM as noted. For example in classical music where the "song" 402 can be an indi- • 

30 vidual movement of a symphony, and therefore can be quite long, the index 1 (ie 41 0) can point to a trumpet solo, and \ 
the index r (ie 412) can point to a violin solo of interest. Depending upon the capabilities of the server 306 and juke box ;!r 
310 in Fig. 1 , the user can address the desired CD ROM 312, and address a desired index 410. It is noted however, 
with reference to both Fig. 1 , and to Fig. 2, that the user is limited to addressing, and so accessing, material only down 
to the level of the particular CD (ie 312), or perhaps the specified predefined index (ie 410). It is not possible to "drill 

35 down" to an arbitrary further specified level of fine grain detail. 

[0024] Fig. 3 depicts an illustrative embodiment of an addressing method, in this case to be used in relation to audio 
CD ROMs. The CD ROM 312, formerly described by the description 400 which contains a list of individual songs 402, 
each of which may contain a level of indexing (eg 410 see Fig. 2) is extended, using a logical model based upon con- 
secutive time blocks or slices, and using a set of rules defining how the logical model is to be applied, into a hierarchical 

40 representation comprising both the description 400 and the further description 500 comprising time blocks 502 to 512. 
The logical model, when applied to the CD ROM 312, serves to form a hierarchical representation of the otherwise mon- 
olithic AV content of the CD ROM 31 2. The model thereby enables systematic and rapid addressing of arbitrary content 
fragments on a time block basis, and provides the desired arbitrary drill down capability. Using the described represen- 
. tation, a user is able, for example, to select an arbitrary fragment of audio content on the CD ROM 312 by specifying a 

45 fragment address, or fragment identifier, of the form Title / songl / block 2 - block j-3", where j is an arbitrary index as 
shown in Fig. 3. The present logical model is used for illustrative purposes, and more advantageous logical models and 
addressing schemes are proposed later in the description. In Fig. 3 song 402 is shown to comprise blocks 502 to 506, 
song 2 comprising blocks from the block after 504 through to block 506 and so on. 

[0025] Fig. 4 shows a process 900 for applying an audio-visual (ie AV) addressing scheme to locate fragments of 
so audio-visual content. The process 900 comprises three sub-processes, namely a definition sub-process 902, an encod- 
ing sub-process 904, and a decoding sub-process 906. The definition sub-process 902 defines, as depicted by an 
arrow 910, a logical model 908 for a class of AV media. After the definition sub-process 902, the logical model 908 is 
used (as depicted by an arrow 912) in the encoding sub-process 904 for encoding an address for an audio-visual 
resource fragment belonging to the class of AV media. The logical model 908 is also used (as depicted by an arrow 91 4) 
55 for locating the audio-visual resource fragment that is associated with the address. 

[0026] Turning to Fig. 5, the hierarchical representation 602, comprising descriptions 400 and 500, is shown in a 
system content in more detail. By using the fragment identifier 610, derived from the hierarchical representation 602, in 
conjunction with the URI 324 (see Fig. 1), an extended URI 606 (commonly referred to as a "URI reference" is shown 
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to incorporate both the URI 324 described in relation to Fig. 1, and an additional fragment identifier 608. The URI ref- 
erence can thus be used as an address to the CD 312, and further, to the desired fragment 314. 
[0027] Fig. 6 depicts another hierarchical representation 706, determined using a logical model appropriate for dig- 
ital video. In this example, a sequence of digital video shots 700 is recorded on a CD ROM 724. The logical model 

5 selected resolves the video sequence 700 into frames eg 708, each frame being further resolved into x intervals eg 71 0 
and y intervals eg 722. This logical model is used for illustrative purposes, and more advantageous logical models are 
proposed later in the description. Using the described representation, a user is able, for example, to select an arbitrary 
spatial fragment of video content on a specified frame of the CD ROM 724 by specifying a fragment address, or frag- 
ment identifier, of the form "Title / framel / x1 - x2; y1 - y2 M . The x interval from x1 (726) to x2 (728) and the y interval 

10 from y1 (730) to y2 (732) address the spatial region 704 within the frame 702 in the set of digital video shots 700. The 
URI reference 716 therefore contains a portion 734 prior to the hash sign 720 which addresses the digital video disc 
724, while the portion 736 after the hash sign 720 addresses the fragment 704. 

[0028] Having provided an illustrative description of an embodiment of the invention, a more detailed description is 
now provided. XML is utilised as a basis for describing a preferred embodiment of the present invention. This is both 
15 from the standpoint of conceptual and notational convenience, and also because XML has significant support as a rec- 
ommendation in the context of the World Wide Web Consortium (W 3 C). 

[0029] It is shown, in the preferred embodiment, how the XML Path Language (XPath), can be extended and used 
in an unexpected manner in order to locate fragments of non XML-based audio-visual content. 

[0030] The XML Linking Language (Xlink) uses URI's for locating objects. In principle, modified URIs can be used 
20 for locating any resource that has identity, for instance, an electronic document, an image, a service, a collection of 
other resources, a person, an corporation, or a bound book in a library. Each resource corresponds to an entity or set 
of entities in a conceptual model. URI's can therefore be used for locating or referencing resources other than XML doc- 
uments. However, the XPath and XML Pointer Langauge (Xpointer) schemes that XL[nk currently uses for addressing 
the internal structure of data objects can only be used to locate fragments of XML documents. 
25 [0031] As an introduction, the use of XLink, XPointer, and XPath, are considered in the limited context of XML doc- 
uments. XPath models an XML document as a tree of nodes. There are seven types of nodes, namely root nodes, ele- 
ment nodes, text nodes, attribute nodes, namespace nodes, processing instruction nodes and comment nodes. XPath 
uses a compact, non-XML syntax to facilitate the use of XPath within URI's. An XPath location path consists of a '/-sep- 
arated list of location steps. Each location step has the form: 

30 

axis :: node-test [predicates] 

where axis specifies the tree relationship between the nodes selected by the location step and the context 
node; node-test specifies the node type or the name; and predicates refine the set of nodes selected by the 
35 location step. 

[0032] A number of syntactic abbreviations allow common cases to be expressed concisely as follows: 

@ is short for attribute::, e.g. attribute: :type can be abbreviated as ©type, 
40 II is short for /descendant-or-self::node(y, 
. is short for self: :node(), and 
.. is short for parent::node(). 

[0033] An axis specifies the tree relationship between the nodes selected by the location step and the context node. 
45 XPath axes include child, parent, descendant, ancestor, following-sibling, preceding-sibling, following, preceding, 
attribute, namespace, self, descendant-or-self and ancestor-or-self. The default is the child axis. XPointer extends 
XPath adding the string and range axes. 

[0034] A node test specifies the node type or the name (such as the name of an element or an attribute) of the 
nodes selected by the location step. 
so [0035] There can be zero or more predicates for refining the set of nodes selected by the location step. Predicates 
are evaluated for each candidate location along the specified axis, and typically test the element type, attributes, posi- 
tions, and/or other properties of the candidate nodes. 

[0036] A function library provides a set of predicate functions such as count(), position(), id(), last(), etc. Each func- 
tion takes zero or more arguments and returns a single result. Like XPointer, a new scheme can define new functions 
55 to extend the core functions of XPath. 

[0037] Each location step is evaluated with respect to a context. The context is initially the document root, or more 
generally the results of a prior location step. The node set selected by the location step is the node set that results from 
generating an initial node set from the axis and node test, and then filtering that node-set by each of the predicates in 
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turn. 

[0038] Some examples of XPath location paths are as follows: 

/doc/c h apte r[2]/secti o n[3] 
5 selects the third section of the second chapter of doc 

chapter[contains(string(title), "Overview"] 

selects the chapter children of the context node that have one or more title children containing the text "Overview" 
child::* [self: : appendix or self:: index] 

selects the appendix and index children of the context node 
~w child: :*[self::chapter or self::appendixj [position()=first()] 

selects the first chapter and appendix children of the context node 
para[@type="warning"] 

selects all para children of the context node that have a type attribute with value "warning" 
para[@id] 

15 selects all the para children of the context node that have an id attribute. 

[0039] XPath operates on the logical structure of an XML document, this logical structure being defined either 
explicitly by a Document Type Definition (DTD), or implicitly by arrangement of tags. For instance, the examples given 
in the previous section assumed an XML document with the structure, in Extended Backus-Naur Form (ENBF) as fol- 
20 lows: 

doc ::= toe chapter+ appendix* index 
chapter ::= section* 
section ::= para+ 

25 appendix ::= section+ . •?' 

where "toe" means "table of contents", "+" means "one or more, "*" means zero or more", and the composite 
description presented above describes, in expanded form, a document comprising a table of contents,. one; or * 
more chapters, zero or more appendices, and an index, where each chapter comprises one or more sections, t . 
30 where each section comprises one or more paragraphs, and finally where each appendix comprises one. or 

more sections. 

[0040] In an XML document, each of these structures is marked by a pair of appropriately named tags. The tag n 
markup allows the logical structure of the document to be determined unambiguously. Hence, any application that 
35 understands the syntax of XML can determine the location of the document's components. Any application that under- 
stands the XPath and XPointer notations can use an URI with an XPath/X Pointer fragment identifier to locate parts of 
the document. 

[0041] Audio-visual, or AV content, is not stored as XML documents and cannot be marked up. However, given an 
unambiguous logical structure, or model, a modified XPath location/addressing method can be used. Hence, for each 
40 class of AV content, in the first instance, an unambiguous logical structure must be defined, and in order to make it 
widely accessible, the logical structure should preferably be published. By an unambiguous logical structure or model, 
it is meant that different persons and applications will segment given content in exactly the same way given the model. 
[0042] Considering one type of AV content, for instance, Digital Video format as used by digital video cameras, this 
can be modelled as: 

45 

dv ::= frame* 

where this means a digital video comprising one or more frames. 

so [0043] In the case, for example, where compatible digital video cameras generate meta-data to represent and 
record the instances the camera starts recording (designated a REC event), a shot can be defined as an interval 
between two REC event. In this case, the model for DV format is: 

dv ::= shot* 

55 

shot ::= frame+ 

meaning a digital video comprising zero or more shots, each shot comprising one or more frames. 
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[0044] As another example, the logical structure of Compact Disc Audio can be modelled as follows: 

cd Audio ::= track* 
track channel channel index* 
5 channel ::= sample* 

meaning an audio CD comprising zero or more tracks, each track comprising two channels, and zero or more 
indices, and each channel comprising zero or more samples. 

w [0045] Considering a more complex example, consider Digital Video Disc, or DVD, video which can provide: 

over 2 hours of high-quality digital video (over Bona double-sided, dual-layer disc), 
up to 8 tracks of digital audio, each with as many as B channels 
up to 32 subtitle/karaoke tracks 
15 up to 9 camera angles (different viewpoints) can be selected during playback 
up to 32 separate subpicture channels 

[0046] Other data types include Video Manager Information files, Video Title Set files, Program Chain Information 
files, still picture Video Objects, attributes for Title, Part_of_Titles, and Menus, Time Map Tables, Part_of_Title Search 

20 Pointers, and Navigation Commands. 

[0047] DVD-Video content is broken into titles and chapters (or parts of titles). Titles are made up of cells linked 
together by one or more program chains (PGC). Individual cells can be used by more than one PGC. Different PGCs 
define different sequences through mostly the same material. Additional material for camera angles and branching is 
interleaved together in small chunks. The DVD player jumps from chunk to chunk, skipping over unused angles or 

25 branches, to stitch together the seamless video. 
[0048] One logical model for DVD-Video is: 

dvdVideo ::= mainMenu? title* subpicture* file* 
mainMenu ::= menu* 
30 menu ::= menu* 

title ::= chapten- 

chapter ::= view+ audio+ subtitle* 
view ::= frame+ 
audio ::= channel* 
35 channel ::= sample+ 

[0049] The previous logical models each relate to a class of AV content, namely digital video, compact disc audio, 
and digital video disk. As noted, the application of the logical models to the associated AV content produces hierarchical 
representations of the AV content which supports addressing of fragments of the content. 
40 [0050] Turning to the aspect of addressing, each location step is evaluated with respect to a context. The context is 
initially the root node, dvdVideo in this case. In general, the context is the results of a prior location step. The node set 
selected by the location step is the node set that results from generating an initial node set from the axis and node test, 
and then filtering that node-set by each of the predicates in turn. 

[0051] Fig. 7 shows a process 1000 for locating audio-visual fragments using the "avptr" addressing scheme. This 
45 process 1000 depicts the decoding sub-process 906 in Fig. 4. A URI reference 1002 (http://www.apxcom.com/prod- 
ucts/dvd0111#avptr(/dvdVideo/title/chapter/audio[1])) is provided, as depicted by an arrow 1004 which is equivalent to 
an arrow 91 6 in Fig. 4, to a step 1 006. In the step 1 006, the URI portion (http://www.apxcom.com/products/dvd01 1 1 ) of 
the URI reference 1002 is used to locate a resource. In a following step 1010, a root of the fragment address, ie dvd 
Video in the URI reference 1002, is used to determine a content type, and consequently, an associated logical model 
so 1012 (which has already been defined in the definition sub-process 902 in Fig. 4). In a following step 1018, the logical 
model 1012 (which is equivalent to the logical model 908 in Fig. 4) is used to set the initial context to the root node of 
the fragment. 

[0052] Thereafter, a decision step 1022 determines whether the aforementioned steps have determined the last 
location step required to locate the desired resource fragment. If the last location step has been determined, the proc- 
55 ess 1000 is directed, according to a "YES" arrow 1024, to a step 1026 in which the located resource fragment is proc- 
essed. The step 1026 is a post-location processing step. 

[0053] If the decision step 1022 determines that the last location step has not been determined, then the process 
1000 is directed, according to a "NO" arrow 1028, to a step 1030 which evaluates the next location step within the cur- 
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rent context. This is described in more detail with reference to Fig. 8. Thereafter, in a step 1034, node(s) are selected 
that satisfy the specified selection criteria (see Fig. 8 for description of these criteria). In a following step 1 038, the con- 
text is set to the node(s) selected, and the process 1 000 is then directed, according to an arrow 1 040, to the testing step 
1022. 

5 [0054] Fig. 8 shows a process 1 100 for evaluating the "evaluation of a location" step 1030, which is shown as a 
dashed box. The dashed box 1030 in Fig. 8 corresponds to the sub-process 1030 in Fig. 7. The step 1 030 is a location 
step of the form axis::node-test[predicates]. In a first step 1102 (which follows the decision step 1022 in Fig. 7 as 
depicted by the arrow 1028), a hierarchical relationship specified by the axis is used to select a set of nodes within the 
current context. Thereafter, in a step 1 1 06 the node test is performed, thereby selecting nodes that are of the specified 

"to node type or node name from the current node set In a following step 1110, the predicates are applied to each node in 
the current node set, selecting only those nodes that satisfy the predicates. 

[0055] The described AV location scheme can, utilising a notation and mechanism similar to those of 
XPath/XPointer, locate analog and digital AV content within a database, or a plurality of databases. 
[0056] A set of named functions are defined for the AV location scheme. For instance, 

15 

time(startTime [,endT1me]) 

For determining whether the current context is within the specified time. 
timecode(startTimecode [, endTimecode]) 

For determining whether the current context is within the time specified by the start and end timecodes. 

20 

[0057] The functions can be used for evaluating expressions, the evaluation always occurring with respect to the 
current context. 

[0058] In addition, new axes can be added, for instance, a time axis and a region axis for locating temporal and spa- 
tial segments of the data. The incorporation of these axes provides additional power to the concept of fragment 
25 addressing, and allows drilling down to different aspects of the AV content. 

[0059] The time axis selects, within the current context, components that occur within the specified start and end 
time. The current context is taken as starting at time zero and progressing continuously through time in normal play 
time. If the end time is not specified, it is taken to be the same as the start time and the component that occurs at or 
closest to the specified start time is selected. 

30 

TimeLocationStep ::= 'time' StartTime (',' EndTime)? 
TimeUnit ::= 'hTmTsVms' 
TimeNotation ::= 'end' I ([0-9]+ TimeUnit) 
StartTime ::= TimeNotation 
35 EndTime ::= TimeNotation 

[0060] For example, 

http://www.apxcom.com/products/dvd01 1 1 #avptr(/dvdVideo/title[2]/time::0m,1 5m) 1 ] 

40 

selects the first 1 5 minutes the second title of the specified DVD. This example is designated example [1 ]. 
[0061] In example [1 ], the method works as follows: 





Selection step 


Meaning 


Context after the selection step 


axis 


(default is child::) 


all the children of the context 
node 


all the children of the dvdVideo 
node, that is, all the mainMenu, 
title, subpicture and file nodes 


node-test and predicate 


Title[2] 


any title node whose position is 
2 


the 2 nd title nodes 


axis 


Time- 


arrange the current selection 
continuously in time starting at 
time zero 


unchanged 
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• 




Selection step 


Meaning 


Context after the selection step 


node-test 


0m,15m 


all content inside the time inter- 
val Omin to 15min 


the first 15 minutes of the 2 nd title 



[0062] Fig. 9 depicts the example [1] as a process 1200 for locating video data on a digital video tape, using the 
"avptr" addressing scheme. A URI reference 1202 (depicted as [1] in the body of the description) is provided, as 
depicted by an arrow 1218, to a step 1204 in which the URI portion (ie http://www.apxcom.com/products/dvd01 1 1) of 

10 the URI reference 1202 is used to locate a resource. In a following step 1206, the root of the fragment address, ie dvd- 
Video in the URI reference 1202, is used to determine a content type, and consequently, an associated logical model 
1208. In a following step 121 0, the logical model 1 208 is used to set the initial context to the root node of the fragment. 
[0063] Thereafter, the process 1200 is directed to a step 1212 which evaluates the location step, ie title[2] in the 
URI reference 1202, thereby selecting the 2 nd title node. Thereafter, in a step 1214, the location step ie time:: Om, 15m 

15 in the URI reference 1 202 is evaluated, thereby selecting content within the first 1 5 minutes of the 2 nd title node. There- 
after, the process 1000 is directed to a step 1216 in which the located resource fragment is processed. The step 1216 
is a post-location processing step. 
[0064] In a variation of example [1 ]: 

20 http://www.apxcom.com/products/dvd01 1 1#avptr(/dvdVideo/title[position()=1 or 

position()=2)][time( n 0mY15m")] [2] 

selects the first 15 minutes of the 1 st and the 2 nd titles of the specified DVD 
[0065] As another example, 

25 

http://www.apxcom.com/products/dvd01 1 1#avptr(/dvdVideo/title [(position 0=1 or 
position()=2]ttime("0m", M 1 5m")] [3] 

selects the first 15 minutes of the first and the second titles of the DVD. 
30 [0066] The method, applied to example [3], works as follows: 



35 




Selection step 


Meaning 


Context after the selection 
step 


40 


axis 


(default is child::) 


all the children of the con- 
text node 


All the children of the dvd 
Video node, that is, all the 
mainMenu, title, subpicture 
and file nodes 




node-test and predicate 


title[position()=1 orposi- 
tion()=2] 


any title node whose posi- 
tion is either 1 or 2 


The first and the second title 
nodes j 


45 


additional predicate 


[timerOmYISm")] 


any content inside the time 
interval Omin to 15min (of 
each candidate node) 


the first 15 minutes of the 
first and the second titles 



[0067] The timecode axis selects, within the current context, components that occur within the specified start and 
end timecode. If the end timecode is not specified, it is taken to be the same as the start timecode and the component 
so that occurs at the specified start timecode is selected. The timecode is represented by a time value or a combination of 
date and time values as defined in International Standards Organisation (ISO) 8601. It can also be an SMPTE (ie Soci- 
ety of Motion Picture and Television Engineers) timecode in the format of HH:MM:SS:FF where FF stands for frame. 



TimecodeLocationStep ::= timecode* '::' StartTimecode ( 7 EndTlmecode)? 
55 TimecodeNotation ::= 'beginTend'lsmpteTimecodeltimel( ,,n dateTime '"') 
StartTimecode ::= TimecodeNotation 
EndTlmecode ::= TimecodeNotation 
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[0068] For example, 

http ://www. apxcom .co m/eve n ts/p rod uctLau nch 99#avptr 
(/dv/shot/timecode::00:1 5:00:00,00:30:00:00) [4] 

5 

selects 15 minutes of clips on the specified digital video tape using SMPTE timecodes 

[0069] The region axis selects, within the current context, the 2D region that is bounded by the specified bounding 
curve. The origin corresponds to the top-left corner of a frame with the x- and y-axis coordinates increasing to the right 
and down. Coordinates are specified in (integral) pixel values. Several types of bounding curves such as rectangle and 
~w ellipse allow the (anti-clockwise) angle between its major axis and the x-axis to be specified. To allow a region to be 
specified for different resolutions of the same content, the resolution of the source from which the bounding curve is 
determined could be specified using the range() function. 

RegionLocationStep ::= 'region* [Range] BoundingCurve 
75 BoundingCurve ::= Shape 

Shape ::= Circle I Ellipse I Rectangle I Polygon I QuadCurve I CubicCurve I Bspline 

Circle ::= 'circle(' XCentre ',' YCentre 7 Radius ')* 

Ellipse ::= 'ellipseC XCentre 7 YCentre 7 Major 7 Minor 7 Angle ')' 

Rectangle ::= 'rect(' Left 7 Top 7 Width 7 Height 7 Angle ')' 
20 Polygon ::= 'polygon? Point 7 Point (7 Point )+ ')' 

QuadCurve ::= 'qcurve(' Point 7 Point (7 Point )+ ')' 

CubicCurve ::= 'ccurve(' Point 7 Point (7 Point )+ ')' 

BSpline ::= tosplinef Point 7 Point (7 Point )+ ')' 

Point ::= Integer 7 Integer 
25 Integer ::= [+[-] Digits 

Angle ::= Degree .. 

Range ::= 'range (' Integer 7 Integer ')' 

[0070] For example, 

30 . - 

http ://www.apxcom .co m/eve n ts/prod uctLa u nch 9 9#avpt r(/dv/s h ot[ 1 ] 
/frame[1 01 2]/region::range(720,480)rect(40,40,60,60,45) [5] 

selects a 60x60 diamond-shape region from the 1 012 th frame of shot one of the specified digital video tape. 

35 [0071] Fig. 10 depicts the example [5] as a process 1300 for locating visual data in a region, using the "avptr" 
addressing scheme. A URI reference 1302 (depicted as [5] in the body of the description) is provided, as depicted by 
an arrow 1322, to a step 1304 in which the URI portion (ie http://www.apxcom.com/events/productLaunch99) of the URI 
reference 1302 is used to locate a resource. In a following step 1306, a root of the fragment address, ie dv in the URI 
reference 1 302, is used to determine a content type, and consequently, an associated logical model 1 308. In a following 

40 step 1310, the logical model 1308 is used to set the initial context to the root node of the fragment. 

[0072] Thereafter, the process 1300 is directed to a step 1312 which evaluates the location step, ie shot[1] in the 
URI reference 1202, thereby selecting the 1 st shot node. Thereafter, in a step 1314, the location stepframe[1012] in the 
URI reference 1302 is evaluated, thereby selecting the node of frame[1012] of the 1 st shot node. Thereafter in a step 
1318, a location step region::range(720,480)rect(40,40,60,60,45) in the URI reference 1302 is evaluated, thereby 

45 selecting content within the specified rectangle of frame[1 012] of the 1 st shot. Thereafter, the process 1300 is directed 
to a step 1320 in which the located resource fragment is processed. The step 1320 is a post-location processing step. 
[0073] As another example, 

http://WAW.apxcom.corrVprodu [6] 

50 

selects the first minute of the second track of an audio CD which has the model 

cdAudio ::= track* 
track ::= channel channel index* 
55 channel ::= sample* 

[0074] The portion of the URI reference before the hash refers to the AV product, in this instance an audio CD no. 
010239, belonging to a class of products, ie audio CDs, produced by the fictitious company "apxcom", which is the 
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resource in this instance. What follows the hash sign is an "AV" fragment identifier, or pointer for locating specific AV 
content on the designated CD. The AV pointer is directed to the internal structure of the content, in this case the first 
minute of the second track. Therefore, the URI (Universal Resource Indicator), which is the familiar entity used in Inter- 
net addressing, when combined with a fragment identifier (the part of the URI reference following the hash sign), is 
5 called a URI reference. 

[0075] The location steps work as follows: 



10 




Selection step 


Meaning 


Context after the selection 
step 




axis 


(default is child::) 


all the children of the context 
node 


all the children of the cdAudio 
node, that is, all the track nodes 


15 


node-test and predicate 


track[2] 


any track node whose position is 
2 


the 2 nd track node 




axis 


(default is child::) 


all the children of the context 
node 


all the children of the 2 nd track 
node 


20 


node-test and predicate 


channel 


any channel node 


all the channel nodes of the 2 nd 
track node 


25 


axis 


time:: 


arrange the current selection 
continuously in time starting at 
time zero 


unchanged 


node-test 


time::0s,60s 


all content inside the time inter- 
val Osec to 60sec 


the first minute of the 2 nd track 



30 [0076] Fig. 1 1 depicts the example [6] as a process 1400 for locating audio data on an audio CD, using the "avptr" 
addressing scheme. A URI reference 1402 (depicted as [6] in the body of the description) is provided, as depicted by 
an arrow 1404, to a step 1406 in which the URI portion (ie http://www.apxcom.com/products/acd010239) of the URI ref- 
erence 1 402 is used to locate a resource. In a following step 1408, a root of the fragment address, ie cdAudio in the URI 
reference 1402, is used to determine a content type, and consequently, an associated logical model 1410. In afollowing 

35 step 1412, the logical model 1410 is used to set the initial context to the root node of the fragment. 

[0077] Thereafter, the process 1400 is directed to a step 1414 which evaluates the location step, ie track[2] in the 
URI reference 1402, thereby selecting the 2 nd track node. Thereafter, in a step 1416, the location step channel in the 
URI reference 1402 is evaluated, thereby selecting all the channel nodes of the 2 nd track node. Thereafter in a step 
1418, a location step time::0s,60s in the URI reference 1402 is evaluated, thereby selecting content within the 1 st 

40 minute of all the channel nodes of the 2 nd track node. Thereafter, the process 1400 is directed to a step 1420 in which 
the located resource fragment is processed. The step 1420 is a post-location processing step. 

[0078] In a further example (no. [7]), the URI to a 1 5 minutes segment of the second movie on a DVD can have the 
form: 

45 http://www.apxcom.com/products/dvd01 1 1#avptr(/dvdVideo/title[2]/time: :30m, 15m) [7] 
while a further example (no. [8]), ie 

http://www.apxcom.com/products/dvd01 1 1 #avptr(/dvdVideo/title/chapter/audio[1 ]) [8] 

50 

will se'ect the first audio track of the DVD. 

[0073] In considering the above URI references, it is again noted that the portion of the URI before the hash sign 
refers to the AV product, namely the resource. What follows the hash sign is an AV fragment identifier for locating parts 
of the AV content. 

55 [0080] Fig. 12 presents a description of a prior art scenario in the context of an XML document 114 presented on 
a browser (not shown). It is noted that the document 114 as shown depicts a physical aspect of the XML document, 
whereas a user of the browser would be presented with the document in a different style (not shown). The document 
114 describes AV content about the Apollo 13 space mission. The document type is designated by a reference 100, 
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which in the present instance is a "Documentary". A number of hyperlink references 104, 106 provide links to other XML 
documents 120, 122, which describe movie sources and movie reviews (not shown) respectively. The movie sources 
referred to by document 120 can be either on-line, or alternatively, can be a physical entities such as a video-cassette 
128 (depicting a specific movie source in this example) produced by a company. The document 1 14 contains a segment 

5 110 named "Rocket Launch" between tag delimiters 108 and 124. The rocket launch segment commences 15 minutes 
after the start of the documentary as indicated by the start index 112. When a user selects the release reference 104, 
the associated link depicted by an arrow 118 directs the user to the XML document 120 describing the movie source 
128 as already noted. Selection of a reference 132 on the document 120 retrieves a UR1 126, which points, as indicated 
by an arrow 130, to the physical video cassette 128. The URI 126 is seen to comprise a link to a company having a 

~io domain name movies designated 134, where the specific cassette is designated "vhs01 1 1" ie 136 in the "products" cat- 
egory 138. Reviewing the aforementioned process for clarity of explanation, selection by the user of the Apollo 13 ref- 
erence 1 04 directs the user to the specific cassette 1 28 through a location process depicted by the URI 1 26. Noting that 
the URI has a standard format and does not incorporate a fragment identifier, it is clear that the described scenario does 
not support drilling down to the fragment level of the non-XML AV resource. The URI does address the particular video 

75 cassette 128, but provides no mechanism for addressing AV data on the cassette at the fragment level. 

[0081] Turning to Fig. 13, a preferred embodiment of the proposed addressing method is described. Selection of 
the preview reference 200 on the main XML document 114 activates a link depicted by an arrow 202 which is directed 
to another XML document 204. This latter document 204 describes preview AV material at the fragment level, and 
selection of a reference 226 results in a link depicted by a dashed arrow 206 pointing to an AV fragment using an 

20 extended URI 208. The portion of the URI 208 before the hash relates to VHS Preview content designated "01 100", 
which is a product of a fictitious company called "movies". The portion of the URI after the hash is an AV pointer 212, 
pointing to the second video 21 6 of the vhs (21 4) tape, and in particular to a segment (21 8) starting 900 seconds after 
the start of the documentary, and ending 1800 seconds after the start of the documentary. 

[0082] The method of addressing an arbitrary fragment of an AV resource is preferably practiced using a conven- 
es tional general-purpose computer system 800, such as that shown in Fig. 14 wherein the processes of Figs. 4 and 7^to 
1 1 may be implemented as software, such as an application program executing within the computer system 800. In par- 
ticuiar, the steps of the method of addressing an arbitrary fragment of an AV resource are effected by instructions in the 
software that are carried out by the computer. The software may be divided into two separate parts; one part for carrying 
out the addressing methods; and another part to manage the user interface between the latter and the user. The soft- 
30 ware may be stored in a computer readable medium, including the storage devices described below, for example. The & 
software is loaded into the computer from the computer readable medium, and then executed by the computer. A com- : g 
puter readable medium having such software or computer program recorded on it is a computer program product. The j% 
use of the computer program product in the computer preferably effects an advantageous apparatus for addressing-an 
arbitrary fragment of an AV resource in accordance with the embodiments of the invention. 
35 [0083] The computer system 800 comprises a computer module 801 , input devices such as a keyboard 802 and 
mouse 803, output devices including a printer 815 and a display device 814. A Modulator-Demodulator (Modem) trans- 
ceiver device 816 is used by the computer module 801 for communicating to and from a communications network 820, 
for example connectable via a telephone line 821 or other functional medium. The modem 816 can be used to obtain 
access to the Internet, and other network systems, such as a Local Area Network (LAN) or a Wide Area Network 
40 (WAN). 

[0084] The computer module 801 typically includes at least one processor unit 805, a memory unit 806, for example 
formed from semiconductor random access memory (RAM) and read only memory (ROM), input/output (I/O) interfaces 
including a video interface 807, and an I/O interface 813 for the keyboard 802 and mouse 803 and optionally a joystick 
(not illustrated), and an interface 808 for the modem 81 6. A storage device 809 is provided and typically includes a hard 

45 disk drive 810 and a floppy disk drive 81 1 . A magnetic tape drive (not illustrated) may also be used. A CD-ROM drive 
812 is typically provided as a non-volatile source of data. The components 805 to 813 of the computer module 801 , typ- 
ically communicate via an interconnected bus 804 and in a manner which results in a conventional mode of operation 
of the computer system 800 known to those in the relevant art. Examples of computers on which the embodiments can 
be practised include IBM-PC's and compatibles, Sun Sparcstations or alike computer systems evolved therefrom. 

so [0085] Typically, the application program of the preferred embodiment is resident on the hard disk drive 810 and 
read and controlled in its execution by the processor 805. Intermediate storage of the program and any data fetched 
from the network 820 may be accomplished using the semiconductor memory 806, possibly in concert with the hard 
disk drive 81 0. In some instances, the application program may be supplied to the user encoded on a CD-ROM or floppy 
disk and read via the corresponding drive 81 2 or 81 1 , or alternatively may be read by the user from the network 820 via 

55 the modem device 81 6. Still further, the software can also be loaded into the computer system 800 from other computer 
readable medium including magnetic tape, a ROM or integrated circuit, a magneto-optical disk, a radio or infra-red 
transmission channel between the computer module 801 and another device, a computer readable card such as a 
PCMCIA card, and the Internet and Intranets including email transmissions and information recorded on websites and 
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the like. The foregoing is merely exemplary of relevant computer readable mediums. Other computer readable mediums 
may be practiced without departing from the scope and spirit of the invention. 

[0086] The method of addressing an arbitrary fragment of an AV resource may alternatively be implemented in ded- 
icated hardware such as one or more integrated circuits performing the functions or sub functions of addressing. Such 
5 dedicated hardware may include graphic processors, digital signal processors, or one or more microprocessors and 
associated memories. The dedicated hardware described can be incorporated into specialised or general purpose 
equipment which addresses AV content. 

[0087] Adoption of XML as a notation for describing the preferred embodiment is a convenient mechanism for 
describing the embodiment. It also allows a consistent view and addressing mechanism for both XML and non-XML 
io resources. As noted previously however, this is not an essential feature of the present invention. 

Industrial Applicability 

[0088] It is apparent from the above that the embodiments of the invention are applicable to the computer and data 
15 processing industries. 

[0089] The foregoing describes only some embodiment of the present invention, and modifications and/or changes 
can be made thereto without departing from the scope and spirit of the invention, the embodiments being illustrative and 
not restrictive. 

20 Claims 

1. A method for addressing an AV fragment of an AV resource over a network, to an arbitrary level of resolution, said 
AV resource being a member of a class of AV resources, wherein a logical model is associated with members of 
the class of AV resources; said method comprising steps of: 

25 

determining a URI network address for the AV resource; 

applying the logical model to the AV resource to form a hierarchical representation of the AV resource including 
a representation of the AV fragment; 

determining a fragment identifier for the fragment dependent upon the representation of the AV fragment; and 
30 combining the URI network address and the fragment identifier to form a URI reference, being an address for 

the AV fragment. 

2. A method according to claim 1 , wherein the determining of the fragment identifier includes a sub-step of: 

35 identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon one of (a) the fragment iden- 

tifier, (b) the URI, and (iii) the fragment identifier and the URI. 

3. A method according to claim 1 , wherein the determining of the fragment identifier comprises a sub-step of: 

40 identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon the root of the fragment 

identifier. 

4. A method according to either one of claims 2 and 3, wherein said identifying step comprises a sub-step of: 

45 applying an XPath based addressing scheme to the fragment identifier, said scheme including at least one of 

a time axis, a time function, a region axis, and a region function, for addressing temporal and spatial fragments 
of the AV resource. 

5. A method according to either one of claims 2 and 3, wherein the type of the AV resource includes at least one of (i) 
so digital video, (ii) analog video, (iii) compact disc audio, (iv) analog audio, and (v) digital video disc. 

6. A method for locating an AV fragment of an AV resource over a network, to an arbitrary level of resolution, said AV 
resource being a member of a class of AV resources, wherein a logical model is associated with members of the 
class of AV resources; said method comprising steps of: 

55 

using a URI network address portion of a URI reference to locate the AV resource; 

identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon one of (a) the fragment iden- 
tifier, (b) the URI, and (c) the fragment identifier and the URI ; and 
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applying an XPath based addressing scheme to the fragment identifier, said scheme including at least one of 
a time axis, a time function, a region axis, and a region function, for addressing temporal and spatial fragments 
of the AV resource, thereby locating the AV fragment. 



5 7. A method according to claim 6, wherein the identifying step comprises: 



identifying (i) the type of the AV resource, and (ii) the logical model, dependent upon the root of the fragment 
identifier. 



~io 8. A method according to claim 6, wherein the type of the AV resource includes at least one of (i) digital video, (ii) ana- 
log video, (iii) compact disc audio, (iv) analog audio, and (v) digital video disc. 

9. A method for addressing an AV fragment of an AV resource over a network, to an arbitrary level of resolution, said 
AV resource being a member of a class of AV resources, wherein a logical model is associated with members of 

is the class of AV resources; said method comprising steps of: 

determining a URI network address for the AV resource; 

applying the logical model to the AV resource to form a hierarchical representation of the AV resource including 
a representation of the AV fragment; 
20 determining a fragment identifier for the fragment dependent upon the representation of the AV fragment, 

including a sub-step of identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon one 
of (a) the fragment identifier, (b) the URI, and (iii) the fragment identifier and the URI; wherein said identifying 
step comprises a sub-step of applying an XPath based addressing scheme to the fragment identifier, said 
scheme including at least one of a time axis, a time function, a region axis, and a region function, for address- 
es ing temporal and spatial fragments of the AV resource; and 

combining the URI network address and the fragment identifier to form a URI reference, being an address for 
the AV fragment £ 

10. An apparatus for addressing an AV fragment of an AV resource over a network, to an arbitrary level of resolution, 
30 said AV resource being a member of a class of AV resources, wherein a logical model is associated with members 

of the class of AV resources; said apparatus comprising: 



first determining means for determining a URI network address for the AV resource; 

applying means for applying the logical model to the AV resource to form a hierarchical representation of the 
35 AV resource including a representation of the AV fragment; 

second determining means for determining a fragment identifier for the AV fragment dependent upon the rep- 
resentation of the AV fragment; and 

combining means for combining the URI network address and the fragment identifier to form a URI reference, 
being an address for the AV fragment. 

40 

11 . An apparatus according to claim 1 0, wherein the second determining means includes: 



identifying means for identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon one 
of (a) the fragment identifier, (b) the URI, and (iii) the fragment identifier and the URI. 

45 

12. An apparatus according to claim 1 0, wherein the second determining means includes: 

identifying means for identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon the 
root of the fragment identifier. 

so 

13. An apparatus according to either one of claims 1 1 and 12, wherein said identifying means includes: 

applying means for applying an XPath based addressing scheme to the fragment identifier, said scheme 
including at least one of a time axis, a time function, a region axis, and a region function, for addressing tem- 
55 poral and spatial fragments of the AV resource. 

14. An apparatus according to either one of claims 1 1 and 12, wherein the type of the AV resource comprises at least 
one of (i) digital video, (ii) analog video, (iii) compact disc audio, (iv) analog audio, and (v) digital video disc. 
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15. An apparatus for addressing an AV fragment of an AV resource over a network, to an arbitrary level of resolution, 
said AV resource being a member of a class of AV resources, wherein a logical model is associated with members 
of the class of AV resources; said apparatus comprising: 

5 first determining means for determining a URI network address for the AV resource; 

first applying means for applying the logical model to the AV resource to form a hierarchical representation of 
the AV resource including a representation of the AV fragment; 

second determining means for determining a fragment identifier for the fragment dependent upon the repre- 
sentation of the AV fragment, including identifying means for identifying (i) a type of the AV resource, and (ii) 

10 the logical model, dependent upon one of (a) the fragment identifier, (b) the URI, and (iii) the fragment identifier 

and the URI; wherein said identifying means comprises second applying means for applying an XPath based 
addressing scheme to the fragment identifier, said scheme including at least one of a time axis, a time function, 
a region axis, and a region function, for addressing temporal and spatial fragments of the AV resource; and 
combining means for combining the URI network address and the fragment identifier to form a URI reference, 

75 being an address for the AV fragment. 

16. An apparatus for locating an AV fragment of an AV resource over a network, to an arbitrary level of resolution, said 
AV resource being a member of a class of AV resources, wherein a logical model is associated with members of 
the class of AV resources; said apparatus comprising: 

20 

utilisation means for using a URI network address portion of a URI reference to locate the AV resource; 

identifying means for identifying (i) a type of the AV resource, and (ii) the logical model, dependent upon one 
of (a) the fragment identifier, (b) the URI, and (c) the fragment identifier and the URI ; and 

25 

applying means for applying an XPath based addressing scheme to the fragment identifier, said scheme 
including at least one of a time axis, a time function, a region axis, and a region function, for addressing tem- 
poral and spatial fragments of the AV resource, thereby locating the AV fragment. 

30 17. A computer readable memory medium for storing a program for apparatus for addressing an AV fragment of an AV 
resource over a network to an arbitrary level of resolution, said AV resource being a member of a class of AV 
resources, wherein a logical model is associated with members of the class of AV resources; said program com- 
prising: 

35 code for a first determining step for determining a URI network address for the AV resource; 

code for an applying step for applying the logical model to the AV resource to form a hierarchical representation 
of the AV resource including a representation of the AV fragment; 

code for a second determining step for determining a fragment identifier for the AV fragment dependent upon 
the representation of the AV fragment; and 
40 code for a combining step for combining the URI network address and the fragment identifier to form a URI ref- 

erence. 

18. A computer readable memory medium according to claim 17, wherein the code for the second determining step 
includes: 

45 

code for an identifying step for identifying (i) a type of the AV resource, and (ii) the logical model, dependent 
upon one of (a) the fragment identifier, (b) the URI, and (iii) the fragment identifier and the URI. 

19. A computer readable memory medium according to claiifT17, wherein the code for the second determining step 
so includes: 

code for an identifying step for identifying (i) a type of the AV resource, and (ii) the logical model, dependent 
upon the root of the fragment identifier,. 

55 20. A computer readable memory medium according to either of claims 1 8 and 1 9, wherein said code for the identifying 
step includes: 

code for an applying step for applying an XPath based addressing scheme to the fragment identifier, said 
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scheme including at least one of a time axis, a time function, a region axis, and a region function, for address- 
ing temporal and spatial fragments of the AV resource. 

21. A computer readable memory medium according to 1 8 and 19, wherein the type of the resource comprises at least 
5 one of (i) digital video, (ii) analog video, (iii) compact disc audio, (iv) analog audio, and (v) digital video disc. 

22. A computer readable memory medium for storing a program for apparatus for addressing an AV fragment of an AV 
resource over a network, to an arbitrary level of resolution, said AV resource being a member of a class of AV 
resources, wherein a logical model is associated with members of the class of AV resources; said program com- 

~jo prising: 

code for a first determining step for determining a URI network address for the AV resource; 
_code for a first applying step for applying the logical model to the AV resource to form a hierarchical represen- 
tation of the AV resource including a representation of the AV fragment; 

15 code for a second determining step for determining a fragment identifier for the fragment dependent upon the 

representation of the AV fragment, including code for an identifying step for identifying (i) a type of the AV 
resource, and (ii) the logical model, dependent upon one of (a) the fragment identifier, (b) the URI, and (iii) the 
fragment identifier and the URI; wherein said code for the identifying step comprises code for a second apply- 
ing step for applying an XPath based addressing scheme to the fragment identifier, said scheme including at 

20 least one of a time axis, a time function, a region axis, and a region function, for addressing temporal and spa- 

tial fragments of the AV resource; and 

code for a combining step for combining the URI network address and the fragment identifier to form a URI ref- 
erence, being an address for the AV fragment. 

25 23. A computer readable memory medium for storing a program for apparatus for locating an AV fragment of an AV 

resource over a network, to an arbitrary level of resolution, said AV resource being a member of a class of -AV r: 

resources, wherein a logical model is associated with members of the class of AV resources; said progranvcom- 

prising: 

30 code for a utilisation step for using a URI network address portion of a URI reference to locate the AV resource; ^ 

code for an identifying step for identifying (i) a type of the AV resource, and (ii) the logical model, dependent 
upon one of (a) the fragment identifier, (b) the URI, and (c) the fragment identifier and the URI ; and 

35 code for an applying step for applying an XPath based addressing scheme to the fragment identifier, said 

scheme including at least one of a time axis, a time function, a region axis, and a region function, for address- 
ing temporal and spatial fragments of the AV resource, thereby locating the AV fragment. 

24. A signal carrying machine readable instructions for causing a processor to perform a method according to any one 
40 of claims 1 to 9 or causing the processor to perform as an apparatus according to any one of claims 10 to 16. 
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